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Efficiency in foreign exchange markets 



Abstract 

In this paper we test the efficiency hypothesis in financial market. A market 
is called efficient if the price variations "fully reflect" relevant information, i.e. 
a speculator cannot make a profit out of it. A currency exchange market is a 
natural candidate to check efficiency because of its high liquidity. We perform a 
statistical study of weak efficiency in Deutschemark/US dollar exchange rates using 
high frequency data. In the weak form of efficiency the information can only come 
from historical prices. 

The presence of correlations in the returns sequence implies the possibility of a 
statistical prevision of market behavior. We show the existence of correlations by 
means two statistical tools. A first analysis has been performed using structure 
functions. This approach gives an indication on the returns distributions at 
different lags r. We have also computed the generalized correlation functions of 
the return absolute values; roughly speaking this is a test of the independence of 
the fluctuations of fixed size. In both cases we have obtained a clear evidence of long 
term return anomalies. This implies a failure of the usual "random walk" model 
of the returns; nevertheless the presence of long term correlations does not directly 
imply the fault of the weak efficiency hypothesis : it is not obvious how to use time 
correlation to make a profit in a realistic investment. 

Then we show how this information is relevant for a speculator. First we 
introduce a measure of the available information relevant from a financial point 
of view, with a technique which reminds the Kolmogorov e-entropy. Second in the 
case of no transaction costs, we propose a simple investment strategy which leads 
to an exponential growth rate of the capital related to the available information. 

We have performed two kind of information analysis in the return series. We 
show that the available information is practically zero if the speculator wants to 
change his portfolio systematically after a fixed lag r : for him the market is efficient. 
Instead, a finite available information is observed by a patient investor who cares 
only of fluctuation of given size A. This is the first far as we know, in which 

the available information obtained by a suitable data analysis is directly linked to 
the possible earnings of a speculator who follows a particular trading rule. 



1 Introduction 

A large amount of research suggests that prices are related with information, and in 
particular it focuses on efficiency in financial markets. A market is inefficient if a 
speculator can make a profit out of information present in the market. Since the celebrated 
work of Fama |lj a big effort has been done to test empirically and to understand 
theoretically the efficiency of financial markets. 
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A market is said to be efficient if prices "fully reflect" all available information, i.e. such 
information is completely exploited in order to determine the price, after having taken 
into account the costs to use this information and a transient time, due to costs, to reach 
equilibrium. The idea is that the investor destroys information while using it and as a 
consequence he contributes to produce equilibrium. 

In the last years long term correlations have been observed in financial markets. We 
shall not review in details the contributions to the field. We stress that long term return 
anomalies are usually revealed via test of efficiency in a semi-strong form, i.e. not only 
considering the asset prices but also some other publicly known news. The interest 
is generally focused on the market reactions to an event occurred a fixed period time 
before (three to five typically) such as divested firms ||, mergers or initial public 
offerings 0, [|] . Recent research |], [7], [], |TD|, |TT[| has pointed out the existence of long 
range correlations also in the weak form. However only low frequency data are considered 
and implications on efficiency are not completely understood. 

In this paper we focus on efficiency in the weak form, i.e we consider only the information 
coming from historical prices. We are interested on a time scale longer than the typical 
correlation returns time (few minutes) but lower than the characteristic time after which 
we do not have statistical relevance of the results: in this sense we deal with long term 
return anomalies. Currency exchange seems to be the natural subject for an efficiency 
test. We expect that such markets are very efficient as a consequence of the large liquidity. 
For these reasons we have decided to analyze a one year high frequency dataset of the 
Deutschemark/US dollar exchange, the most liquid market. Our data, made available 
by Olsen and Associated, contains all worldwide 1,472,241 bid-ask Deutschemark/US 
dollar exchange rate quotes registered by the inter-bank Reuters network over the period 
October 1, 1992 to September 30, 1993. 

One of the main problem in tick data analysis, is the irregular spacing of quotes. In this 
paper we consider business time, i.e. the time of the transaction given by its rank in the 
sequence of quotes. This seems to be a reasonable way to consider time in a worldwide 
time series, where time delays and lags of no transaction are often due to geographical 
reasons. 

In this paper we test the independence hypothesis of returns and define and measure 
an available information. In section 2 we check the independence with two different 
techniques. The first one, called structure functions analysis, shows whether it is possible 



to rescale properly the distribution functions at different lags (T^|. The second one is a 
direct independence test. The independence of two random variables x, y implies that 
f(x) and g(y) are uncorrelated for every / and g. We check it for /(•) = g(-) = | ■ \ q . We 
interpret these quantities as an estimate of the correlation between returns of given size. 
We want to quantify the available information and discuss its financial relevance. In 
section 3 we consider a speculator with a given resolution, i.e. he is concerned only about 
fluctuations at least of size A. This reminds the e entropy introduced by Kolmogorov [|1^ 
in the context of information theory. A similar filter has been first introduced by 
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Alexander |T|, |15| . To show the inefficiency of the market he proposed the following 
trading rule : if the return moves up of A, buy and hold until it goes down of A from a 
subsequent high, then sell and maintain the short position till the return rises again of A 
above a subsequent low. 

Here we divide the problem in two parts. First we define the available information for 
any fixed resolution A of the speculator. Second, as suggested by Fama we relate 
the available information with the profitability by means of a particular trading rule. We 
show how this information is related to the optimal growth rate portfolio using a simple 
approximation in terms of Markov process. 
In section 4 we summarize and discuss the results. 



2 Long term correlations 



After the seminal work of Bachelier |16| , it was widely believed that the price variations 
follow an independent, zero mean, gaussian process. The main implications of the 
"fundamental principle" of Bachelier are that the price variation is a martingale and 
it is an independent random process. 

Bachelier considers the market a "fair game" : a speculator cannot exploit previous 
information to make better predictions of forthcoming events. Information can come only 
from correlations and in absence of them from the shape of the probability distribution 
of the returns. 

For about sixty years this contribution was practically forgotten, and quantitative analysis 
on financial data started again with advent of computers. 

Following Fama ]]J , we shall call hereafter "random walk" the financial models where the 
returns 

are independent variables. In this paper we define St as the average between bid and ask 
price. We do not want to enter here in a detailed analysis of the huge literature about 
"random walk" models. We just mention that, before the contribution of Mandelbrot [17] , 



the return r t was considered well approximated by an independent gaussian process. 
Mandelbrot proposed that the returns were distributed according a Levy-stable, still 
remaining independent random variables. 
At present, it is commonly accepted that the variables 

r t (r) = £ r, = ln%21 (2) 
t'=t+i 

do not behave according a gaussian at small r, while the gaussian behavior is recovered for 
large r. Of course a return r t distributed according to a Levy, as suggested by Mandelbrot, 
is stable under composition and then also would follow the same distribution for every 
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t. A recent proposal is the truncated Levy distribution model introduced by Mantegna 



and Stanley [|T1| which fits well the data and reproduces the transition from small to large 
r. 

Let us focus our attention on independence tests. We remark once again that an influence 
of the return r t at time t on the return r t+T at time t + r implies a not fully efficient market 
in a weak form. The relevance of the question is clear in the case of an investor analyzing 
historical data to a make market forecast and a profit out of it. 

As a test of independence it is generally considered the correlation functions on time 
intervals r 

C(r) = (r t r t+T )-(r t )(r t+T ) , (3) 



where (■) denotes the temporal average 



1 T 



1 t=i 

and T is the size of the sample. 

The presence of correlations in Deutschemark/US dollar exchange returns before the 
nineties is a well known fact. For example in [fL9| , where it is considered the same 
dataset we use, it is shown that the returns are negatively correlated for about three 
minutes. 

We remind that in general uncorrelation does not imply independence. A sort of long 
term memory can be revealed with appropriate tools, see for example the seminal works 



in the field of Alexander JT4], pl| and Niederhoffer and Osborne , and the most recent 
literature ||, 0, |8|, |9|, |2l], p2"| , where it is shown that absolute returns or powers of returns 
exhibit a long range correlation. It is a common belief that it is not possible to exploit 
this kind of information because of transaction costs. 

We shall show in next section that dependent (even if uncorrelated) returns have a clear 
financial meaning because they imply the existence of available information. 
In subsection 2.1 we show the persistence of a long range memory for the 
Deutschemark/US dollar exchange rate by means of the analysis of structure functions. 
In subsection 2.2, we test directly the independence of returns with a generalization of 
the correlation analysis. 

2.1 Structure functions 

There is some evidence that the process rj cannot be described in terms of a unique 
scaling exponent p3|, |2l, i.e. it is not possible to find a real number h such that the 



statistical properties of the new random variable jr h do not depend on r. 
The scaling exponent h gives us information on the features of the underlying process. In 
the case of independent gaussian behavior of r t the scaling exponent is 1/2. 
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On the contrary, the data show that the probability distribution function of 
r[ T ^ / \jVar[r[ T ^] changes with r [^, |24j. This is an indication that r t is a dependent 
stochastic process and it implies the presence of wild fluctuations. 

A way to show these features, which is standard for the fully developed turbulence 
theory p5] , is to study the structure functions : 

F q {T) = {\rP\«). (4) 

In the simple case where r t is an independent random process, one has (for a certain range 
of r) 

F q {r) ~ T h « , (5) 

where h < 1/2 in the Levy-stable case while the gaussian behavior is recovered for h = 1/2. 
The truncated Levy distribution corresponds to h < 1/2 for r sufficiently small and to 
h = 1/2 at large r. "Random walk" models present always a unique scaling exponent. If 
the structure function has the behavior in (|5|) we call the process self-affine (sometimes 
called uni-fractal). 




Figure 1: Structure functions Mog 2 F g (r) versus log 2 r for Deutschemark/US dollar 
exchange rate quotes. The three plots correspond to different value of q : q = 2.0 (o), 
q = 4.0 (□) and q = 6.0 (+). In the insert we show £ q versus q. We estimate with linear 
regression two different regions in this graph. The first one is a line of slope 0.5 (dashed 
line), and the second has a slope 0.256 (dash dotted line). 



As previously mentioned a description in terms of a unique scaling exponent h, does not 
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work. Therefore instead of (j^) one has 

F q (r) ~ r 



(6) 



where £ q are called scaling exponents of order q. If £ q is not linear, the process is called 
multi-affine (sometimes multi-fractal). Using simple arguments it is possible to see that 
£ g has to be a convex function of q J26|. The larger is the difference of £ q from the linear 
behavior in q the wilder are the fluctuations and the correlations of returns. In this sense 
the deviation from a linear shape for £ 9 gives an indication of the relevance of correlations. 
In figure [3] we plot, the F q {r) for three different values of q. A multi-affine behavior is 
exhibited by different slopes of -log 2 (F q ) vs. log 2 (r), at least for r between 2 4 and 2 15 . 
For larger business lags a spurious behavior can arise because of the finite size of the 
dataset considered. In the insert we plot the £ 9 estimated by standard linear regression 
of log 2 Fg(r) vs. log 2 (r) for the values of r mentioned before. To give an estimation 
of errors, the most natural way turns out to be a division of the year dataset in two 
semesters. This is natural in the financial context, since it is a measure of reliability of 
the second semester forecast based on the first one. We observe that the traditional stock 
market theory (brownian motion for returns), gives a reasonable agreement with £ g ~ q/2 
only for q < 3, while for q > 6 one as £ g ~ hq + b with h = 0.256 and c = 0.811. We stress 
once again that such a behavior cannot be explained by a "random walk" model (or other 
self-affine models) and this effect is a clear evidence of correlations present in the signal. 



2.2 Long term correlations analysis 

Let us consider the absolute returns series which is often shown to be long range 

correlated in recent literature || [7|, ||, |, |10|, [□], |2l], [22j. Absolute values mean that we 
are interested only in the size of fluctuations. 
Let us introduce the generalized correlations C q (r): 

C q (T) = (\r^\r t+T \^-(\r t \^(\r t+T \^ . (7) 

We shall see that the above functions will be a powerful tool to study correlations of 
returns with comparable size: small returns are more relevant at small q, while C q {j) 
is dominated by large returns at large q (the usual definition of correlation for absolute 
returns is recovered for q = 1). 

Following the definitions in J27|, let us suppose to have a long memory for the absolute 
returns series, i.e. the correlations C q (r) approaches zero very slowly at increasing r, i.e. 
C q {r) is a power-law: 

C q {r) ~ t-P* . 

If \r t \ q is an uncorrelated process one has f3 q — 1, while f3 q less than 1 corresponds to long 
range memory. 

Instead of directly computing correlations C q {r) of single returns we consider rescaled 
sums of returns. This is a well established way, if one is interested only in long term 
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analysis, in order to drastically reduce statistical errors that can affect our quantities |28| 



Let us introduce the generalized cumulative absolute returns jllj, [TT 

Xt, q (r) = -J2\r t+ ^ (8) 

and their variance 

S q (r) = {xM 2 ) - { Xt , q (r)) 2 . (9) 

After some algebra (see Appendix), one can show that if C q (r) for large r is a power-law 
with exponent (3 q , then b~ q (r) is a power-law with the same exponent : 



C q { T ) ~ r-h =>► 5, 



In other words the hypothesis of long range memory for absolute returns (/3 q < 1), can be 
checked via the numerical analysis of S q (r). 




og 2 r 



Figure 2: log 2 S q versus log 2 r. The three plots correspond to different value of q : q — 1.0 
(o), q — 1.8 (□) and q = 3.0 (+). In the insert we show f3 q versus q, the horizontal line 
shows value (3 = 1 corresponding to independent variable. 



In figure |]we plot the S q vs. t in log-log scale, for three different values of q. The variance 
S q (r) is affected by small statistical errors, and it confirms the persistence of a long range 
memory for a r larger than 2 4 and up to 2 15 . 

The exponent [3 q can be profitably estimated by standard linear regression of log 2 (5 g (r)) 
versus log 2 (T), and the errors are estimated in the same way of subsection 2.1. 



9 



We notice in the insert that the "random walk" model behavior is remarkably different 
from the one observed in the Deutschemark/US dollar exchange for q < 3. This implies 
the presence of strong correlations, while one has f3 q — 1 for large values of q, i.e. big 
fluctuations are practically independent. 

An intuitive meaning of the previous results is the following. Using different q one selects 
different sizes of the fluctuations. Therefore the non trivial shape of f3 q is an indication of 
the existence of long term anomalies. 

3 Available information 

Let us focus our attention on information analysis of the return r t . We must treat the 
dataset in such a way that methods of information theory can be applied. 
The usual approach is the codification of the original data in a symbolic sequence. There 
are several ways to build up such a sequence: one should make sure that this treatment 
does not change the structure of the process underlying the evolution of the financial data. 
In order to construct a symbolic sequence from a time series, at least two steps are needed : 

• A filtering procedure to remove most of the noise in the dataset. 

• A coarse graining procedure to partition the range of variability of the filtered data, 
in order to assign a conventional symbol to each element of the partition. 

The codification is then straightforward: a symbol corresponds unambiguously to the 
data stored in each element of the partition. 

From the original signal r t we obtain a discrete symbolic sequence : 



where each q takes only a finite number, say m, of values. In such a way we reduce ourself 
to the study of a discrete stochastic process. 

A simple way to obtain a symbolic sequence is to consider only a two- valued symbol and 
define a discrete random variable without performing any filtering operation : 



The financial meaning of this codification is rather evident: the symbol —1 occurs if the 
stock price decreases, otherwise the symbol is 1. 

Let us now remind some basic concepts of information theory. Consider a sequence of n 
symbols C n = {ci, c 2 , . . . , c n } and its probability p(C n ). The block entropy H n is defined 



Ci,C 2 , ...,Q,... 




(10) 



by 



H n = -^p(C n )lnp(C n ) . 
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The difference 

h n = H n+ i — H n (12) 

represents the average information needed to specify the symbol c n+ i given the previous 
knowledge of the sequence {c\, C2, . . . , c n }. 

The series of h n is monotonically not increasing and for an ergodic process one has 

h = lim h n (13) 



where h is the Shannon entropy |29 



It is easy to show that if the stochastic process {ci, C2, . . .} is markovian of order k (i.e. the 
probability to have c n at time n depends only on the previous k steps n— 1, n — 2, . . . , n—k), 
then h n = h ioT n > k. In other cases, or h n goes to zero for increasing n which means that 
either for n sufficiently large the (n + l)th-symbol is predictable knowing the sequence C n 
or it tends to a positive finite value. The maximum value of h is ln(ra). It occurs if the 
process has no memory at all and the m symbols have the same probability. 
The difference between ln(m) and h is intuitively the quantity of information we may use 
to predict the next result of the phenomenon we observe, i.e. the market behavior. We 
define available information : 

I = ln(m) - h = R\n(m) (14) 
where R — 1 — hj ln(m) is called the redundancy of the process [|2£f] . 



Hereafter we limit the discrete process to take only two values, —1 and 1 which have an 
evident financial meaning. We expect that the high frequency details are not relevant 
and cannot be easily used by financial analysts. It seems rather reasonable to study the 
process r t with a finite lag r (see subsection 3.1) or a finite resolution A on the values of 
r t (see subsection 3.2). 

In the following we shall show that different discretization procedures lead to completely 
different results. This corresponds to two different kind of investment, one systematic 
and the other patient. The systematic investor modifies his portfolio every r steps, where 
the lag t is measured in the usual business time (but the same results hold also for the 
calendar time). The patient investor, instead, waits to update his strategy until a certain 
behavior of the market is achieved, for example, a fluctuation of size A. 
In the last part of this section we shall show that this kind of investment seems to be the 
most suitable for financial aims. 



3.1 A naive approach: fixed lag analysis 

In a recently proposed time series model the price variation is considered as a result 
of a true underlying process plus an uncorrelated white noise. If we think the observed 
return as the sum of these two components, it is natural to try to eliminate the additional 
noise taking the average of the signal over a given lag. 
More precisely we treat the financial data as follows : 
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we group the sequence of the r t in non overlapping blocks of r data and we define 
the sum of the data in the j th block 



j{r+l) 

' 3 = 



fc=JT+l 

Notice that r'- is equivalent to r^) where is defined in equation (Q). 

• we decimate the data, i.e. we take from the sequence r'- only one data every m : 

Rk = r mk ■ 

This procedure should eliminate the eventual short time correlation of the noise. 

• we build up the symbolic sequence, like in equation ([TQ[) : 

/ -1 if R k <0 
Ck = { +1 if R k > ■ (15) 

Let us remark that the first two steps have been performed to reduce the noise. 
The total number of the data of the symbolic sequence is N/(mr), where N is the 
number of original data. This filter is linear, i.e. if the signal is a linear combination of 
various contributes, at the end of the filtering procedure we have the sum of the filtered 
contributes. The theory of linear filter is well developed in literature (see for example 
f28l), and we use this simple approach to check whether a noise is added to our signal. 
At this point we have a binary sequence from which we compute the Shannon entropy. 
Figure ||| reports the results of our analysis for the linear filter. We plot h n vs n for various 
t and m, compared with the entropy of Bernoulli trials with probability p = 0.5 (this is 
nothing that the usual coin tossing). 

We know that the entropy h n of a fair binary Bernoulli trial must be ln(2) for every n. 
The folding of h n at large n depends on the finite number of sequence elements. It can be 
proved that the statistical analysis does not give the proper value of h n for n larger 



than : ^ 

n* w - ln(JV) 
h 

where h is the entropy of the signal and N is the length of the sequence. 
It should be now clear that the entropy of the sequence is given by the value of the plateau. 
The entropy does not differ sensibly from ln(2), of the coin tossing, and, therefore, 
we cannot make prevision on the market. In conclusion, the financial data cannot be 
represented as a white noise added on a true underlying signal. 

Nevertheless, because of the long term correlations (see section 2), there is a clear 
indication that the present state of the market depends non trivially on the past. 
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Figure 3: h n versus n. The three plots correspond to different value of r and m : 
r = 10, m = 10 (□), r = 10, m = 100 (o) and r = 100, m = 100 (+). We show also the 
entropy numerically obtained from a coin tossing sequence with the same number of data 
of the case r = 10, m — 10 (•). The dotted line indicates ln(2). 

3.2 A fixed resolution analysis 

The failure of the previous analysis lead us to try another approach in order to keep the 
information present in the financial data, this time we use a non-linear filter with a clear 
financial meaning. 

The procedure to create the symbolic sequence is now : 
• we fix a resolution value A and we define 



where to is the initial business time, and t > t . We wait until an exit time t\ such 



In this way we consider only market fluctuations of amplitude A. Since the 
distribution of the returns is almost symmetric, the threshold A has been chosen 
equal for both positive and negative values. Starting from S tl we obtain with the 
same procedure S t2 . 




(16) 



as : 



r *i,*o 



> A . 
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following the previous prescription we create a sequence of returns 

V tl,to) r t2,ti ) • ■ • j r t k ,t k _ 1 i ■ ■ •} ) 

from which we obtain the symbolic dynamics : 



Cfc 



-i if n k ,t k ^ < o 
+1 if n k ,t k ^ > o 



(17) 



We define k as A trading time, i.e. we enumerate only the transactions at which A 
is reached. 
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Figure 4: Evolution of r tk ,t k _ 1 with A = 0.01. to = corresponds at 00:00:14 of October 
1, 1992 in calendar time to the , and the t 4 = 9939 corresponds at 11:59:28 of October 2, 
1992. 



Let us notice that the variable \rt k ,t k _ 1 \ has a narrow distribution close to the threshold, 
and for all practical purposes |rt fc tfc l | can be well approximated with A. In figure f| we 
show an example of evolution of the r tk ^ k _ Y . 

The entropy analysis of the symbolic sequence {7't fc ,t fc _ 1 } gives a completely different result 
from the one in the previous section. In figure [5] it is shown that the entropy is clearly 
different from ln(2) in a wide range of A, i.e. there is a set of A for which the available 
information (see eq. ([14])) is very large. 

In figure || we plot the available information versus A and the distribution of transaction 
costs. Because these two quantities do not have similar size they are plotted on different 
vertical scales but they are superimposed to make easier comparison between them. We 
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Figure 5: h n versus n. The three plots correspond to different value of A : A = 0.00005 
(o), A = 0.0002 (□) and A = 0.004 (+). The dotted line indicates ln(2). 



observe that the maximum of the available information is almost in correspondence to the 
maximum of the distribution of the transaction cost. 
We have estimated the transaction costs 7 as 

j g(ask) g(ask) (i(bid) 

7t = 2 ^ n g(bid) — 2S ( ~ bid ^ > ' 
of course this is an upper bound for the true transaction cost. 

We notice that the available information is almost equal zero when we consider very small 
and very large values of A. These limit values cannot be reached for two different reasons; 
since S t can assume only discrete values, it is not possible to take the limit A — > 0. In 
addition, we cannot compute J A for large A because in the sequence r tfcjtfc _ 1 there are not 
enough data for an efficient statistical analysis. 

3.3 Profitable information 

We focus our attention on optimal strategies in a financial market with non zero available 
information. We then show the economic relevance of such a quantity in case of weak 
efficiency of the market. 

Consider the optimal growth rate strategy for a patient speculator. The returns {^t fe ,t fc _ 1 } 
are almost symmetrically distributed and they can be well approximated by the two 
threshold values A and —A. 
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Figure 6: Available information versus A (on the left), superimposed to the distribution 
of transaction costs, P{^f) versus 7 (on the right). 



We shall only deal with the markovian case. In fact as suggested by figure [| and shown 
in [32], one has that the markovian approximation well reproduces the signal filtered with 
a fixed resolution A. The symmetry of the return distribution and the markovian nature 
of the process implies that the transition matrix is close to 

f/ A 1_ M • (18) 

V 1 - Pa Pa J 

where p A is the probability to have +1 at A trading time (t + 1), knowing that q was +1 
at time t. 

In this particular case the available information is : 

4 = p A lnfej + (1 - p A ) ln(l - p A ) + ln(2) . (19) 

We focus our attention on an investor who decides to diversify his portfolio only in a 
security asset with a given interest rate return r, and to invest, every A trading time t, 
a fraction l t of his capital in the Deutschemark/US dollar exchange. Our convention is 
that the fraction I is positive if he exchange dollars into marks, negative vice versa, and 
we allow the speculator to borrow money from a bank. 

We assume a vanishing interest rate return. This hypothesis is reasonable: in the period 
we are dealing with, the official discount rate fixed by the Federal Reserve is of 3 percent 
per year and fluctuates between 5.75 and 8.75 percent in the German case. The patient 
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speculator rehedges his portfolio on average every 66 seconds when A is equal to the mean 
transaction cost. The largest A corresponds to an average time of 8.6 hours of standby. In 
the time scales involved the true interest rate return is about one hundred times smaller 
than A: the approximation of a vanishing interest rate appears to be fair. 
We deal with the no transaction costs case: this will allow us to understand easily the 
meaning of available information for a patient investor. The more general situation with 
transaction costs is treated in detail in p3 |. 



Let us focus on the investment at time t: the speculator commits a fraction l t in dollars. 
At the following time step t + 1 his capital becomes 

W t+1 = (l+l t c t+1 A)W t . (20) 

We notice that, a consequence of the symmetry, is that the optimal It can assume only 
two values l t = c t l where / is a real number. 

We define the profitable information as the exponential rate of the capital of an investor 
who follows an optimal growth rate strategy. The strict connection between this quantity 
and the available information was first noticed by Kelly ||34j| , who, considering an 



elementary gambling game, first gave an interpretation of Shannon entropy in the context 
of optimal investment. 



The computation of capital growth rate is a simple application of ||34|| , and for the 
investment above described is 

X A (l) = lim ±ln^=p A ln(l + IA) + (l-pJln(l-iA) . (21) 
It reaches its maximum for 

I* = Pa ~ . (22) 

An intuitive consequence of equation ( p2|) is that an anti-persistent return (p A < 1/2), 
as in the financial series we have considered, implies that the optimal strategy is to buy 
marks if the dollar rises, and to do the opposite otherwise. Of course a persistent case 
(p A > 1/2) would imply an l t greater than zero every time the positive threshold A is 
reached. 

From (^I|) and ( P2"D one has that the optimal growth rate is equal to the available 
information: 

= maxA A (Z) = p A lnfej + (1 - p A ) ln(l - p A ) + ln(2) = I A . (23) 

We stress that the equivalence between available and profitable information, if we forget 
the costs involved in this trading rule, means that a speculator, who follows a particular 
strategy, has the possibility to obtain a growth rate of his capital exactly equal to this 
information: this makes clear why we have called it profitable. 

We underline that we have considered the growth rate measuring the time in A trading 
time. To obtain the exponential rate of the capital in the usual calendar time we have 
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to normalize (|21|) with the average exit time for the specified A For example for 

A = 0.0002 corresponding to the maximum available information is characterized by 
(r) = 66 seconds. This means that the average optimal growth rate is equal to 0.27 
percent per second. 

A naive consequence of previous results could be that an efficient market hypothesis should 
be rejected. 

Unfortunately (for the authors) this is not obvious. 

We have previously noticed that the available information can be transformed in profitable, 
let us now comment the feasibility of the proposed trading rule. 

When A is near the value of the maximum available information, the speculator changes 
his position with high frequency, and A is comparable with transaction costs: it is not 
any more possible to neglect them. 

Furthermore in equation ( p2|) A appears at the denominator, and then the values of I* 
can be enormous. For example for A corresponding to the maximum of the available 
information, the speculator who follows the optimal growth rate strategy, should borrow 
2830 times the capital he has! Even a small fluctuation from the expected average behavior 
can lead to bankruptcy. 

On the other hand if he wants to use reasonable values of I*, he has to chose a sufficiently 
large A; in this situation the filtered series is almost indistinguishable from a "random 
walk" and then there is almost no available information. 

We have now all the ingredients to comment the shape of the available information shown 
in figure |6]. 

The speculator cannot have a resolution A lower than the transaction costs, profits from 
such an investment would be in fact less than costs. Therefore in this range of A the 
available information increases. The discretization of the prize changes does not allow 
for reaching in a continuous way A = 0, where the "random walk" model is practically 
recovered as shown in the first part of this section. 

For A larger than the transaction costs the information can be exploited by proper 
strategies. However, small fluctuations are more difficult to detect and to distinguish 
from the "noise" and the profitable information is almost useless because of the huge 
values of I* involved. This fact is even more evident when transaction costs are included. 
On the other hand for large A, the investors are able to discover the available information 
and to make it profitable with a feasible strategy. As a consequence, the efficient 
equilibrium is than restored for all practical purposes. 

Let us briefly mention what happens instead to the systematic investor. We can repeat 
exactly the above discussion and the only difference is that now he decides to modify 
his portfolio every r business time. Because there is almost no available information 
(see subsection 3.1) the optimal growth rate of his capital is vanishing even without 
considering the costs involved in the transactions. 
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4 Conclusions 



In this paper we have considered the long term anomalies in the Deutschemark/US dollar 
quotes in the period from October 1, 1992 to September 30, 1993 and we have analyzed 
the consequences on the weak efficiency of this market. 

In section 2 we have shown the presence of long term anomalies with two techniques: the 
structure functions and a generalization of the usual correlation analysis. In particular we 
have pointed out that "random walk" models (or other self-affine models) cannot describe 
these features. 

Once we have shown the existence of correlations in financial process, we have tested 
whether they allow for a profitable strategy. 

With such a goal in mind, in section 3 we have first introduced a direct measure of the 
available information, then we have shown in a particular case that this is equivalent to 
a profitable information. In other words following a suitable trading rule it is possible in 
absence of transaction costs to have an exponential growth rate of the capital equal to 
this information. 

We have measured the available information with a technique which reminds the 
Kolmogorov e entropy. Two different codifications for financial series (fixed lag r and 
finite resolution A) lead to completely different results. 

The available information strongly depends on the kind of investment the speculator has 
in mind. We show that if he wants to change his position systematically at fixed lags r 
the available information is practically zero: for this investor the market is efficient. 
Instead, a patient investor, who waits to modify his portfolio till the asset has a fluctuation 
A, observes a finite available information. 

However, the existence of such a trading rule does not imply that the investment is feasible 
in practice. Namely we show that when reasonable investments are involved almost no 
available information survives. On the contrary, it is extremely difficult to use it when it 
is still present. 

The technique described here can be considered as a powerful tool to test weak efficiency : 
the speculator contributes to reach efficient equilibria destroying the available information 
that could be exploited in practice. The efficiency hypothesis is then restored for almost 
all practical purposes. 

Appendix 

In this appendix we show that if the correlations C q (r) exhibit a long range memory 
C q {j) ~ r _/3q then also the variance S q (r) of the generalized cumulative absolute returns 
{Xt, q ij)} behaves at large r as r _/3 «. 
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Making explicit expression of Xt, q ( T ) (see equation (H)) one can write equation (|9|) as : 

w = i £ Edn+nrin+rj 9 ) - <in +T1 n<ir i+r2 H . 

r t 1= Ot 2 =0 

Taking into account the fact that r t is a stationary process, and using the definition of 
C q (r), one has: 

8 q (r) = -C q (0) + - 2 £ ~ T*) 

T T 

' T>T 1 >T 2 >0 

where 

C q (0) = (\n\ 2q ) - (\r t \<) 2 . 
The expression of 5 q (r) can be rewritten as: 

5 q (r) = -C q (0) + - 2 (r - ri ) C 3 (rO . 

71 = 1 

Under the hypothesis C g (r) ~ r~^, one has for large r 

T 71 = 1 

which leads to : 

(5 g (r) = 0(r- 1 ) + 0(r^) . 
Since (3 q < 1, the thesis follows, i.e. : 

<J ff (r) ~ r"^ . 
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